Corpus: mwl_wikipedia_2021_30K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
Stados Ounidos 380 283 277 1.40
Nuossa Senhora 32 36 29 1.37
Buenos Aires 22 29 22 1.32
Países Baixos 29 29 28 1.07
Van Gogh 12 14 11 1.39
Hong Kong 17 14 13 1.41
Niagara Falls 8 9 8 1.13
Mato Grosso 10 8 8 1.25
Sei Shōnagon 8 8 7 1.31
Calouste Gulbenkian 5 6 5 1.20
Diógenes Laércio 7 6 6 1.17
Treze Quelónias 6 6 6 1.00
Hockey League 4 5 4 1.25
Racionales MC's 6 5 5 1.20
Ahura Mazda 5 5 5 1.00
Campeones Ouropeus 6 5 5 1.20
Olena Teliha 4 5 4 1.25
libre-de l-cuntesto 5 5 5 1.00
Tel Abib 3 4 3 1.33
Sholen Asch 3 4 3 1.33
163 msec needed at 2021-07-28 00:03